Picture for Zhao Zhong

Zhao Zhong

DisCa: Accelerating Video Diffusion Transformers with Distillation-Compatible Learnable Feature Caching

Add code
Feb 05, 2026
Viaarxiv icon

iFSQ: Improving FSQ for Image Generation with 1 Line of Code

Add code
Jan 27, 2026
Viaarxiv icon

HunyuanVideo-Foley: Multimodal Diffusion with Representation Alignment for High-Fidelity Foley Audio Generation

Add code
Aug 23, 2025
Viaarxiv icon

MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE

Add code
Jul 29, 2025
Viaarxiv icon

X-Omni: Reinforcement Learning Makes Discrete Autoregressive Image Generative Models Great Again

Add code
Jul 29, 2025
Viaarxiv icon

PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion

Add code
Dec 29, 2023
Figure 1 for PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion
Figure 2 for PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion
Figure 3 for PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion
Figure 4 for PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion
Viaarxiv icon

Learning Low-Rank Representations for Model Compression

Add code
Nov 21, 2022
Figure 1 for Learning Low-Rank Representations for Model Compression
Figure 2 for Learning Low-Rank Representations for Model Compression
Figure 3 for Learning Low-Rank Representations for Model Compression
Figure 4 for Learning Low-Rank Representations for Model Compression
Viaarxiv icon

EfficientTrain: Exploring Generalized Curriculum Learning for Training Visual Backbones

Add code
Nov 17, 2022
Figure 1 for EfficientTrain: Exploring Generalized Curriculum Learning for Training Visual Backbones
Figure 2 for EfficientTrain: Exploring Generalized Curriculum Learning for Training Visual Backbones
Figure 3 for EfficientTrain: Exploring Generalized Curriculum Learning for Training Visual Backbones
Figure 4 for EfficientTrain: Exploring Generalized Curriculum Learning for Training Visual Backbones
Viaarxiv icon

Collaboration of Experts: Achieving 80% Top-1 Accuracy on ImageNet with 100M FLOPs

Add code
Jul 08, 2021
Figure 1 for Collaboration of Experts: Achieving 80% Top-1 Accuracy on ImageNet with 100M FLOPs
Figure 2 for Collaboration of Experts: Achieving 80% Top-1 Accuracy on ImageNet with 100M FLOPs
Figure 3 for Collaboration of Experts: Achieving 80% Top-1 Accuracy on ImageNet with 100M FLOPs
Figure 4 for Collaboration of Experts: Achieving 80% Top-1 Accuracy on ImageNet with 100M FLOPs
Viaarxiv icon

Learning specialized activation functions with the Piecewise Linear Unit

Add code
Apr 08, 2021
Figure 1 for Learning specialized activation functions with the Piecewise Linear Unit
Figure 2 for Learning specialized activation functions with the Piecewise Linear Unit
Figure 3 for Learning specialized activation functions with the Piecewise Linear Unit
Figure 4 for Learning specialized activation functions with the Piecewise Linear Unit
Viaarxiv icon